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TO ALL WHOM IT MAY CONCERN: 

Be it known that I, Robert Beach, a citizen of the United States, whose 
post office address is 1850 Middleton Avenue, Los Altos, California 94204, have 
invented an improvement in 

APPARATUS AND METHOD FOR WIRELESS COMMUNICATION 

of which the following is a 

SPECIFICATION 

BACKGROUND OF THE INVENTION 

The present invention relates to wireless communication systems and 
particularly to systems which use wireless data communication. Existing systems for 
wireless local area network (WLAN) data communications include systems made 
according to IEEE Standard 802.1 1 wherein mobile units associate with an access point 
connected to a computer or a wired computer network in order to engage in wireless data 
communications. The assignee of the present invention provides one such system under 
the tradename Spectrum 24®. Another standard for shorter range wireless data 
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communications, for example between a computer and peripherals is the Bluetooth 
Standard, which is available at www.bluetooth.com. 

It is an object of the invention to provide improved methods and apparatus 
for wireless communications. 

5 SUMMARY OF THE INVENTION 

According to the invention there is provided a mobile unit or device which 
has a simplified and versatile construction. The device can be used to implement voice 
control of the mobile device as well as voice control of a remote computer in 
communication with the device. The mobile device can be combined with one or more 

10 additional units, such as a computer or a bar-code scanner, and function as a data 

processing and communications module for providing wireless data communications. 

In accordance with one aspect the invention there is provided a method for 
operating a system by voice; command comprising providing a mobile unit having a 
microphone, a digital signal processor and a radio module for providing wireless data 

1 5 communications to a computer. First voice commands having a limited vocabulary are 
received in the mobile unit and recognized using the digital signal processor of the 
mobile unit. The mobile unit is controlled in response to the first voice commands. 
Second voice commands are received in the mobile unit and converted to digital data 
signals which are sent using the radio module to a computer. The computer is operated to 

20 recognized the second voice command using a large vocabulary voice recognition 
program to derive computer control signals therefrom. 
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In a preferred arrangement of the method, the mobile unit is controlled in 
response to the first voice commands to communicate with the computer. The computer 
can be operated in response to the computer control signals derived from the second voice 
command. In one arrangement the second voice commands may be used to derive 
5 computer control signals which retrieve data from a memory of computer. The retrieved 
data can be converted into voice data and sent the mobile unit, where the voice data is 
converted to analog signals which are supplied to a speaker. In another alternate 
arrangement, the computer control signals are arranged to establish a voice 
communication channel between the mobile unit and at least one other voice 

10 communication device. The computer is operated to establish the voice communication 
channel to transfer voice communication data between the mobile unit and the other voice 
communication device. Establishing the voice communication channel may include 
converting the voice communication data between digital and analog form. 

According to another aspect of the invention there is provided a mobile 

15 device which includes a microphone for receiving sound signals, an interface connected 
to the microphone for converting received sound signals from the microphone to data 
signals and a radio module for sending wireless data communication signals. A digital 
signal processor is provided which includes a program for recognizing a limited number 
of digital data signals from the interface and operating in response thereto to control the 

20 radio, for operating the radio module to send digital data signals and for providing digital 
data signals corresponding to sounds from the microphone as data packets to the radio 
module. 
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In one arrangement the digital processor may be programmed to compress 
the digital data signals corresponding to sounds. The mobile device may include a 
speaker and the interface can be arranged for converting digital data signals representing 
sound signals and providing the sound signals to the speaker. The digital signal processor 
5 is further programmed to provide digital data signals received by the radio module to the 
interface for conversion to sound signals. Preferably the digital processor is programmed 
to compress the digital data signals representing sound signals from the microphone and 
to decompress digital data signals received by the radio. The mobile device may 
optionally have an interface whereby the digital processor is connected to a host 

10 processor such that data signals can be sent or received by the host processor using the 
radio module. The digital processor may optionally be interfaced to a bar code scanner 
for receiving bar code signals from the scanner and for converting the bar code signals to 
digital data signals. In a preferred arrangement the digital processor is programmed to 
receive data signals from the interface and to alternately supply the data signals to first 

15 and second buffer memories during alternating first and second time intervals using direct 
memory access. The processor is further programmed to process data in one of the data 
buffers while data signals are supplied to the other of the data buffers. The processing 
preferably includes use of a data compression algorithm and may additionally include 
voice echo cancellation processing. 

20 For a better understanding of the present invention, together with other 

and further objects, reference is made to the following description, taken in conjunction 
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with the accompanying drawings, and its scope will be pointed out in the appended 
claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is a block diagram of an embodiment of a mobile device in 
5 accordance with the present invention. 

Figure 2 is a block diagram showing the use of the mobile device of 
Figure 1 in a wireless network. 

Figure 3 is a block diagram showing the connection of the Figure 1 mobile 
device to a host computer. 
10 Figure 4 is a block diagram illustrating the operation of the digital 

processor of the Figure 1 device in connection with converting sound signals into 
compressed data signals. 

Figure 5 is a block diagram showing the connection of the Figure 1 mobile 
device to a bar code scanner. 
15 Figure 6 is a block diagram showing the operation of the processor of the 

Figure 1 device in connection with processing received voice signals. 

Figure 7 is a block diagram showing the interface of the digital processor 
of the Figure 1 device to the RF portions of the device. 

Figure 8 is a block diagram showing the operation of the processor of the 
20 Figure 1 device in connection with RF transmission of data packets. 
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Figure 9 is a block diagram showing the operation of the digital processor 
of the Figure 1 device in connection with the reception of data packets. 

D ESCRIPTION OF THE INVENTION 
Referring to Figure 1 there is shown a block diagram of a mobile device 
5 10 in accordance with an embodiment of the present invention. The device 1 includes a 
microphone 12 and a speaker or earphone 14 which are coupled respectively to provide 
and receive audio analog signals to and from analog processor 16. An audio Codex 18 is 
provided for respectively converting audio signals from microphone 12 into digital data 
signals corresponding thereto, and for converting digital data signals representing sounds 

10 into analog signals to be provided to speaker 14. Codex 18 is interfaced to a first serial 
port of a processor 20, which in a preferred arrangement is a Texas Instrument 5409 
digital signal processor (DSP). Those skilled in the art will recognized that other 
processors may be used for purposes of carrying out the invention, but the 5409 DSP is 
considered to have particular advantages in connection with its speed of operation, 

1 5 modest power consumption and other capabilities, including its ability to interface with 
other devices via serial ports and a data bus, as will be described. A second serial port of 
processor 20 is interfaced to RF baseband processor 22 which is coupled to RF analog 
section 24. Processor 22 and analog circuit 24 provide in a preferred embodiment for the 
transmission and reception of digital data signals following the 802.1 1 protocol. Signals 

20 are sent and received by am:enna 26. The mobile device 10 and the method of the present 
invention can be applied to other wireless data communications protocols, such as 
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Bluetooth, the device, method and system will be described in connection with an 802.1 1 
system. Those familiar with the art can reconfigure the device and method for 
compatibility with other systems. 

In one arrangement the mobile device 10 of Figure 1 may be provided as a 
5 hand held "communicator" device which is capable of providing limited on board speech 
recognition for purposes of controlling device 10, and for communicating by digital data 
signals with a remote computer for purposes of providing voice commands to the remote 
computer. In this arrangement, it becomes possible for a person using mobile device 10 
to establish a connection to the computer over an 802.1 1 local area network using first 

10 limited vocabulary voice commands. Once a connection is established, the user can give 
instructions to the computer for performing tasks by transmission of second voice 
commands from mobile device 10 to the computer, through an access point connected to 
the remote computer. For example, a user could instruct device 10 to access the 
computer by a command of a single word or short phrase, such as "computer on". The 

1 5 device 10 would recognize the first voice command "computer on" and be prepared to 
transfer the following second voice commands from the user to the computer. The 
digital processor of mobile unit 10 may, for example, place the device in active mode and 
provide an output voice signal to indicate the command has been understood, such as 
"Ready". 

20 The user may then give a further second command for the computer, such 

as "inventory". In response to the command "inventory" the mobile device 10 would 
convert the command to a digital data signal and send the command to the computer 

NY02-283314 1 "7- 



A33366-072797.0129 



using the radio module. The computer would perform a recognition on the word 
"inventory" using a large vocabulary voice recognition system and realize that the user of 
mobile device 10 wanted information from an inventory maintained in the computer 
system. 

5 The computer then might generate a data signal representing a response to 

the term "inventory" such as "what part number?". This data signal would be sent to 
mobile device 10 over the wireless local area network and be provided to the speaker 14. 
Responding to the computer's request for a part number, the user could give a sequence 
of numbers by voice, which is again relayed to the computer, provided to the large 

1 0 vocabulary voice recognition system, and cause the computer to look up the part number 
in an inventory database and retrieve information concerning the availability of that part 
in the storeroom and its location. Again this information can then be converted to voice 
data format and sent to mobile device 10 as digitally encoded voice signals, which are 
provided by the receiver to speaker 14. 

1 5 Figure 2 shows an example of a system arrangement in which the mobile 

device 10 of the present invention can be used. As shown in Figure 2 there is provided a 
computer 15, which may be a server system, for example, running on a UNIX or other 
operating system capable of supporting a large vocabulary voice recognition system. 
Computer 15 is connected to access point 17 which conducts wireless communication 

20 with mobile device 1 0, using 802. 1 1 protocol or other wireless protocol. Computer 1 5 
may also be connected to other peripheral devices, such as a digital PBX 19 or a printer 
23 . Another mobile unit 2 1 may also be provided for conducting 802. 1 1 or other wireless 
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communication with computer 15 through access point 10 or other access points. 
Alternatively, mobile unit 2 L can provide wireless communication through access point 
17 with mobile device 10. In one environment mobile device 10 is a highly simplified 
device which may include only one or a very small number of keys for operation thereof 
5 In one embodiment, for example, only an on/off key may be provided. In other 

configurations, more than one key may be provided for certain functions such as volume 
up/down or to cause the device to "wake up" from a power saving mode. It is intended 
that for some applications the device 10 is primarily operated by voice command, and 
that the voice commands be interpreted in the device itself, for a very limited vocabulary 

10 of commands, or in computer 15 for a larger vocabulary of commands. Mobile device 10 
can be used to control the operation of computer 15 by voice command, or alternatively, 
may be used for communications, such as telephone communication or walkie-talkie type 
communications with one or more other mobile units 21, which may be similar to mobile 
device 10, or with other telephone devices through PBX 19. 

15 It should be recognized that the second voice commands which are subject 

to interpretation in computer 15 may be interpreted as control instructions for operation 
of computer 15 and its peripheral devices, such as printer 23, but can additionally be used 
to provide control signals which are sent back to processor 20 in mobile device 10 as 
operational instructions. For example, a user may give the command "volume up" or 

20 "volume down". Such instructions can be interpreted with the limited vocabulary of 
programming in processor 1 0, or may be sent as voice representative data signals to be 
interpreted in computer 15. In the later case computer 15, having recognized the 
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instruction creates a corresponding data message which is sent as a control signal over the 
wireless network to be implemented by processor 20 in mobile device 10. 

In a preferred arrangement, mobile device 10 carries on a method whereby 
the unit itself and/or computer 15 may be controlled using voice commands. According 

5 to a preferred arrangement of the invention mobile device 10 is provided with 

programming in DSP 20 that recognizes one or a very small number of voice commands 
provided to microphone 12 by the user. Additional commands are provided by digital 
communication of voice data from mobile device 10 to access point 17 and thereafter to 
computer 15 which has a lar ger vocabulary voice recognition system. 

10 The mobile device 10 can be used as a stand alone unit as described above 

for providing voice command and voice communications function. In addition the 
mobile device 1 0 can be configured as a part of a larger portable device, operating as a 
communication module. 

One example of the application of mobile device 10 is shown in Figure 3, 

1 5 wherein the device is interfaced via the host part of the DSP20 to the system bus of a 
portable computer, point of sale device or digital personnel assistant device. The 
configuration of Figure 3 includes a host CPU38, RAM40, flash memory 42, display 46 
and possibly other peripherals 44 such as a barcode scanner. In the Figure 3 
configuration the mobile device 10 provides the addition function of providing wireless 

20 data communication capability to host CPU 38 so that the operator can send and receive 
data over the WLAN from and to the CPU 38, which may, for example be displayed for 
the user on display 46. 
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Another example of application of mobile device 10 is shown in Figure 5, 
wherein the mobile unit is interfaced to bar-code scanner 90 to provide processing and 
wireless communication of bar code signals from scanner 90. Thus when a user of the 
portable device scans a bar code, the code signals can be processed in DSP20 and sent as 
5 data signals to a host processor. In the Figure 5 configuration the bar code scanner 90 is 
interfaced to the 16 bit bus of DSP20. 

A significant application of mobile device 10 in either a stand alone 
configuration or when combined with other devices is to provide voice communication 
functions. In a first operating mode, telephone communications can be provided, 
10 preferably by providing a PBX 1 9 connected to the wired network which includes 

computer 15 and access point 17, as shown in Figure 2. PBX 19 includes an interface to 
provide conversion of voice representative digital signals into voice signals and vice 
versa. In addition, the interface may provide conversion of digital address signals into 
DTMF signals for operating PBX 19 to access a telephone extension or external number. 
1 5 The address signals may be provided to the PBX interface by computer 1 5 in response to 
voice commands received from mobile device 10 via access point 17. 

In connection with telephone service mobile device 10 operates to send 
and receive voice representative data packets which are converted into voice signals, and 
vice versa, and may operate in full duplex mode. In one embodiment a user may 
20 establish a telephone connection as follows: 
User: "Computer On" 
Response: "Computer Ready" 
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User: 'Telephone" 
Response: "Who do you want to call? 
User: "John Cook" 
Response: "Calling John Cook" 
5 At this point Computer 15 retrieves the telephone or extension number for 

John Cook and sends a message to the interface of PBX 19 which establishes the call and 
converts voice-data packets to voice signals for the telephone communication. 

To end the telephone call the user may invoke the assistance of computer 
by commanding "Computer on", which is recognized by DSP20 to establish a voice link 
10 to Computer 15. This may be followed by a user-voice command, such as "End call" or 
"Hang Up" which is received and executed by Computer 15. Alternatively, where the 
local voice recognition program is not operating during a telephone call, the device may 
be provided with a command button that places the device in voice recognition mode, or 
that interrupts the telephone communication to send voice commands to Computer 15. 
15 Depressing the button cause mobile device 10 to send the voice message to the 

recognition routine of computer 15 rather than send the voice message as part of a 
telephone conversation. 

In the case where a command button is provided, it may be used to bring 
device 10 out of a power saving mode and into a voice command mode. In this 
20 embodiment the voice recognition software in mobile device 10 does not need to be 
constantly running. 

-12- 
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It should be recognized that a call can also be placed by giving a 
telephone or extension number by voice command. 

In addition to regular telephone-connection via PBX 19, the mobile device 
may also be used to establish telephone like voice communications to other mobile units 

5 21 in the system which have the same or similar voice communications arrangement. For 
example in response to "Who do you want to call?" a user may respond "Manager". 
Computer 15 would then establish a voice connection to mobile unit 21, which is carried 
by the manager on duty. This voice connection is established, for example by computer 
15 sending out data messages to mobile device 10 giving the IP address of the Manager's 

1 0 mobile unit 2 1 , and also sending a data message to mobile unit 2 1 , signaling the 

incoming call from mobile device 10 and possibly also sending the IP address of mobile 
device 10. The latter can also be supplied by a message from mobile device 10. 
Communication of voice representative data between mobile device 10 and mobile unit 
21 can proceed via the network for example through access point 17 or via the wired 

1 5 network to a different access point serving mobile unit 21 . 

It will be recognized that Computer 15 can likewise set up "conference 
calls" among mobile units, telephone extensions, outside telephone numbers and mobile 
devices, preferably using voice command and PBX 19 and AP17. Further, units can be 
linked in a walkie-talkie mode, wherein only half duplex communication is provided. 

20 Further, a user of mobile unit 10 may command the computer to connect the unit for a 
broadcast to all or a group of voice capable mobile units, for example members of a 
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security force, or to connect the mobile device to a loudspeaker system for a public 
announcement. 

The mobile device 10 of the invention may additionally be used in voice 
applications that do not involve voice communications. For example, mobile device 10 
5 may be used as an interactive guide device, for example in a museum or on a self guided 
tour. The user may indicate to a computer 15 a location or a work of art that is being 
viewed and receive a description concerning the subject. The description can be in any 
selected available language, and in variable detail. For example, a tourist may only want 
basic information about a work of art, while a student may want more detailed 
10 information. This may be provided when the user responds "yes" in response to a prompt 
"Do you want more information?" The information is stored in computer 15, either in 
digital voice format or in alphanumeric format which is provided to a voice generating 
program. 

Another application of mobile device 10 is to provide voice readout of e- 
15 mail messages. The user of mobile device 10 can access an e-mail account. Newly 
received e-mails can be provided to a voice synthesizer that "reads" the received 
messages upon request. 

Another application of mobile device 10 is to provide music or other 
playback from digital recordings stored in the memory of computer 15, for example under 
20 MP3 format. The user of mobile device 10 can select one or more recordings to listen to 
by voice command. 
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Figure 4 is a block diagram showing the interface of digital processor 20 
to audio Codex 18. Audio Codex 18 is connected to serial port 40 of digital signal 
processor 20. The programing of digital signal processor 20 is arranged to cause data 
provided at serial port 40 via a DMA memory access to a first, second and possibly more 

5 buffer memories 42, 44 within the memory 46 of digital signal processor 20. In particular 
an analog audio signal is converted into a digital signal, for example, a 14 or 16 bit 
signal, in audio Codex 18 and clocked through serial port 40 through a DMA channel into 
a first buffer 42 in memory 46 of digital signal processor 20. The sampling rate may be, 
for example, 64 K bits/sec and may be varied to obtain higher or lower quality audio 

10 sampling. For example, a lower sampling rate may be used for voice communications, 
such as telephone-like communications, on a higher sampling rate may be used for voice 
input, such as the first or second commands, that must be recognized by a voice 
recognition program. The samples are provided to buffer 42 which has a capacity to hold 
10 ms worth of samples. WTien buffer 42 is filled the Interrupt Service Routine (ISR) 

15 causes the DMA channel to automatically switch to a second buffer 44 and continuing to 
provide the voice data to buffer 44 while the digital signal processor processes the data in 
buffer 42. Voice processing will typically take 1 ms to perform 729 compression and 
perhaps another millisecond for other activities, such as echo cancellation and 
packetization of the digital audio signal. After the second buffer 44 is filled the voice 

20 data is provided to the first buffer 42 while the data in buffer 44 is processed by the 
digital signal processor. 
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Figure 7 shows the interface of the digital signal processor 20 to the radio 
frequency portions of the mobile device 10 for transmission of a voice or data pocket. 
Serial port 50 of digital signal processor 20 is connected for data transfer through ASIC 
58 which is connected to the base band processor 60 of the radio portion of the device. 
5 Base band processor 60 is likewise connected to RF analog section 62. The base band 
processor may, for example be an Intersil 3861 unit. The RF analog section may be the 
same as those used in current Spectum 24 products. A second serial interface 52 of 
digital signal processor 20 is connected to provide signals from the digital signal 
processor to the command port of the base band processor 60. In addition control and 
10 status signals are provided between DSP 20 and BBP 60 on data bus 54 and written to 16 
bit latch 56 for providing signals to the command ports of base band processor 60 and 
analog circuit 62 in connection with the transmission and reception of data packets by 
mobile device 10. 

Referring to Figure 8 the a voice representative data packet pocket or data 
15 packet to be transmitted is stored in transmit buffer 64 in memory 46 of DSP 20. The 
data packet may be either a data signal or a signal consisting of digitized, and preferably 
compressed, voice data. DSP 20 is programmed to provide a DMA channel to serial port 
50 under control of the ISR by which the data to be transmitted is clocked into base band 
processor 60. Prior to the transfer of the data packet, the header and other overhead data 
20 are added to the data packet by processor 20. The CRC of the 802. 1 1 signal format may 
be provided in ASIC 58 wherein the CRC data is computed and added to the data packet 
as it is provided to base band processor 60. 
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Figure 9 is a block diagram illustrating the operation of digital signal 
processor 20 in connection with receiving a data packet from the RF module, which may 
be either data or voice encoded data. The received signal packet is provided to serial port 
50 from base band processor 60 and provided by DMA access to a buffer under control of 

5 the ISR, having a first buffer portion 70 for receiving the PLCP header portion of the 

802.1 1 packet, for example, and a second buffer portion 72 for receiving the remainder of 
the packet. Referring to Figure 6 the received data packet, if it consists of encoded voice 
signals is provided from buffer 72 to jitter buffer 74 which compensates for the 
irregularities in time during which the voice signal sample is received over the wireless 

10 local area network. The digital signal processor 20 performs voice decompression and 
places the decompressed sound digital signals into two or more buffers such as buffer 76 
and buffer 78 which are read out to serial port 40 in alternating sequence under ISR 
control, and provided to Codex 18 for the generation of audio signals to be provided to 
speaker 14. 

15 In a preferred embodiment of the mobile device 10 two very real time 

processes, RF and voice, share the same CPU without interfering with one another. A 
key to understanding how this is done is understanding that the two processes are actually 
quite complementary and so share the processor with only a few minor accommodations. 

The voice process consists of two basic activities: one very lightweight 

20 and very real time, and the other rather heavyweight but only very moderately real time. 
The first activity has already been described, and consists of managing the DMA channel, 
Serial Port and buffers that transfer voice data to/and from the audio Codex. The transfer 

-17- 
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rate and amount of data is relatively low and moves at a constant rate. Every 10 ms 
enough data has moved belween the DSP and Codex to require processing by the DSP. 
The actual transfer is handled by DMA and ISR and involves only very quick and simple 
buffer flips. The actual buffer flipping is handled by the DMA controller and so the ISRs 

5 can be a little late since they will be setting up the "next buffer." This is a very common 
double buffering trick. One might even add a third buffer to give us even more "jitter" 
time in servicing the DMA interrupts. 

The other major voice process is the voice compression and echo 
cancellation processing. These are the "heavy" routines, especially compression. The 

10 G.729 code consumes about 10 MIPs on the average to compress 10 ms of voice samples 
into a 16 byte voice "cell". Echo cancellation consumes another MIP or two. Given a 100 
MIPs processor (such as the Tl 5409), this is about 10-15% (10 - 12 MIPS out of 100) of 
the CPU capacity. Hence every 10 ms, the CPU must spend 1-2 ms doing compression 
and echo cancellation. Once started, this processing cannot be halted, although it can be 

15 interrupted without much problem. This is an important concept since even 1 ms is a 

long time for the RF process (one might receive 2 -3 packets of data on the RF channel at 
1 1 Mbits/sec in that time). The key timing goal for the voice processing routines is to 
complete the 1-2 ms of processing before the next 10ms interval begins. In other words, 
one must complete processing the last 10ms of samples before the next 10 ms of samples 

20 is completed. One really lias 1 8 ms to complete the 1ms, but one will then have to do 2 
sets of samples in the next 4ms (and that might be hard depending on what else is going 
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on). One might add a third buffer and so give us 30 ms to complete 6 ms of processing 
but any more buffers beyond that and one begins to impact latency. 

The decompression process utilizes much less CPU resources, perhaps 1-2 

MIPS. 

The RF processing is in many ways quite different than the voice 
processing. It consists of two basic activities: a moderately lightweight but very real time 
process and a moderately lightweight but very loosely realtime process. The first process 
consists of the actual packe t transmit and receive routines. The transmit process is the 
least real time of the two if one ignores the actual data transfer element (which is handled 
by the DMA controller and does not utilize any CPU resources except for setup and 
completion). The process of the building the transmit packet, including Wire Equivalent 
Privacy (WEP), can be done at the same processing "level" as the voice compression 
process (i.e. at "task" level rather than at "interrupt" level). For voice packets, it could 
well be considered part of the voice processing task. The only realtime processing 
element of the transmit process consists of monitoring the channel waiting for a time to 
transmit. Once the data starts moving, the CPU is out of the loop until the DMA 
completes. At that time, the realtime element associated with the transmit process is 
waiting for an ACK (if required). The ACK will appear within a few dozen 
microseconds or not at all. 

The receive process is more realtime since it interacts more closely with 
the BBP, which does not include significant intelligence, and is demanding in terms of 
communication. As noted in earlier sections, the receive process consists of fielding two 

-19- 
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DMA channel interrupts with a little processing after each. The end of packet processing 
of receive packets is a very hard real time task since the goodness of the packet must be 
verified using the CRC data and an ACK readied (if needed) within a slot time of 10- 
20/^s. The ACK itself is trivial to build and send but it must go out on time. 

The other major element of the RF process (for an 802.1 1 System) is the 
association/roaming activity. Most of the time this activity takes place on a longer time 
scale than anything else on the system. The decision to roam and associate with an AP 
takes place over periods of hundreds of ms to seconds. The actual process of associating 
is very quick (a 2 packet exchange of very short duration), but the decision to roam and 
the discovery of other APs is more a background activity than anything else. There are 
exceptions (like when one cannot find any AP), but such activity clearly takes precedence 
over all other activities (except keeping the speaker side of the code fed with at least 
silence). 

This section outlines the framework for the software running on the DFC. 
There is a need for multiple, timers in this model. They are needed for a variety of 
purposes such as: 

Collision avoidance backoff and packet retries for the RF Tx 
process 

Loss of packet clocking on RF receive 
Roaming and associating timing 
There is only a single timer on the Tl 5409 DSP and so it may be 
necessary to either multiplex the times for these purposes, or add additional external 
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timers added. Associated with the timers are ISRs that are involved in changing state of 
the RF process. 

The concept of a simple "main loop" that handles the voice processing 

routines and the association/roaming process is key to the DFC architecture. It should be 

5 noted that in the following discussion, the term "process" is frequently used. In this 

context, "process" simply means "activity" and not a separate thread of processing such 

as one might find in a multitasking kernel. This loop is a simple polling loop that 

responds to event flags and. buffer changes created by the DMA channels and their 

respective ISRs. There are 5 major activities in this loop: 

10 • Voice Processing 

RF Processing 
Data transmission/reception 
Configuration and Control 
User Interface (optional) 

1 5 The voice processing element of the loop is very simple: 

(1) Wait for a signal from the voice receive ISR that the DMA has 
filled up a buffer of voice samples. When the signal is found, 
invoke the voice compression and echo cancellation routines. 
These will occupy the "main loop" process for 1-2 ms. 

20 (2) Wai t for 2 voice cells (20 ms) of voice to become ready and then 

prepare a RTP/UDP/IP/802.1 1 header set. If WEP is required, then 
do so at this point. The header creation is trivial but the WEP 
requires about 400ns/byte and so a 100 byte packet would require 
40/^s, a minor computational burden. When this is done, start the 

25 RF transmit process. 

(3) Wait for a signal from the voice ISR that the DMA has emptied a 
buffer of voice data, and so it is time to take a 16 byte voice cell 
from the jitter buffer and decompress it into voice samples. This 
wili occupy the voice loop for 200-300/^s. 
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The RF side of the main loop consists of two elements: 

Extracting voice cells from packet 
Association and Roaming 

The first task is extracting voice cells from received packets and placing 

5 them in the jitter buffer. This process involves working through the various headers and 
getting down to the voice cell. This is a minor parsing and copying process. 

The association and roaming process has several states that define its 
priority relative to the rest of the system. The initial and most important state is "Not 
associated". In this state the MU is activity looking for an AP. This involves sending 

10 probe packets and waiting for replies. For the mobile device 10, this is about the most 
important activity and when in this state can generally ignore voice processing entirely 
(making sure that the speaker output is muted). If associated then the 
association/roaming process operates as a background activity, collecting information on 
error rates and signal strength as well as looking for other APs. Again this involves 

1 5 generating occasional probe packets and waiting for replies. The computational and 
network load for this activity is minor. 

The Data Transmission and Reception part of the loop deal with the 
transmission and reception of nonvoice data packets. Even in a voice centric 
environment there is a certain amount of data transmission and reception. The mobile 

20 device 10 must communicate with the voice server in computer 17 to determine how it 

should operate. In addition, when the mobile device 10 can be used as an RF module in a 



NY02:283314 1 



-22- 



A33366-072797.0129 



larger terminal, as shown in Figure 3, so it must be capable of providing a WLAN data 
interface to that terminal. 

The data transmission process (DTP) involves examining a single buffer to 
determine if there is anything to send. A host processor 38 or processor 20 process places 

5 data in this buffer and waits for it to be sent. The data my be data being communicated 
by host processor 38, bar code data processed by processor 20 or "overhead" control data 
messages originated by processor 20, such as association/roaming messages and ACK 
signals. The request and completion state is indicated by a simple state variable. If the 
buffer is full, then the applications simply wait for it to become empty. Once a buffer has 

1 0 been filled and recognized by the data transmit process, an 802. 1 1 header will be added to 
it in space reserved for it at the beginning of the buffer. If WEP is required, then it will 
be performed at the 400ns/byte rate. A 1500 byte buffer would require 600^s to encrypt. 
Once this is completed, DTP will call the appropriate routines will be called by DTP to 
schedule the transmission. If a transmission is already in process, the DTP will exit and 

1 5 check back again later. When the transmission is complete, the buffer will be freed by - 
the RF transmit ISR. 

The Data Reception Process (DRP) really begins in the RF reception 
process mentioned earlier. That process receives all packets from the RF receive ISR and 
parses them looking for voice cells. If the packet is not a voice cell, then the packet is 

20 passed to the DRP. The "passing" may be as simple as marking the buffer for the DRP to 
look at and ceasing any further processing on it. The DRP will examine it and determine 
if it some the processor 20 cares about. If not, then it will be passed to the host processor 
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38, bar-code reader 90 or another peripheral device. If the packet is not for any device 
associated with mobile device 10, it will be ignored. 

The Comiratnd and Configuration Process (CCP) is primarily concerned 
with determining the destination (and source) of voice streams. This involves supplying 

5 IP and MAC addresses as well as a code to the voice processing routines. There are a 
variety of mechanisms to do this. If the mobile device 10 is part of a larger terminal, then 
it will likely come via a configuration command (or something as simple as writing the 
data into a well known location). If the mobile device 10 is in communication with a 
voice server in computer 15, then the information will come over the network via a data 

10 packet (the contents of which will be written into those well known locations). 

The final element of the main loop is the User Interface Process (UIP). 
This is really a placeholder for UI code that may be present in mobile device 10. It would 
involve looking for key presses and acting accordingly. This activity may involve a local 
activity like changing the volume or a global activity like changing the destination of the 

1 5 voice stream from another voice system to the voice recognition server. 

The overall model is very simple. It involves a "task" level polling loop 
that is essentially a non preemptive scheduler. Activities at this level are started as a 
result of events at interrupt level. Once started, an activity runs until it finishes or 
otherwise yields control of the processor. Of the various activities at this level, there is 

20 one "big" one which is voice compression. It may run for 1-2 ms at a time. All of the 
other activities are much shorter term. 
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The presence of the voice compression code and its long processing time 
means that the RF packet send/receive process must run almost entirely at interrupt level. 
This means the main loop code or other routines, such as voice compression, will be 
frequently interrupted but so long as the main loop gets 20% of the CPU cycles, 
5 everything will work fine. The RF interrupt processing will consume far less that 80% of 
the CPU, even at 100% network utilization. 

Most packet transmissions originate at the main loop level (the only 
exception are the ACKs that are generated in the RF Receive ISR). At that level, the 
802.1 1 packet is created and prepared for transmission. A call is made to a subroutine 

10 that will start the transmission process and take "ownership" of the buffer. The actual 
transmission will start sometime later in an ISR. Likewise the completion action will 
occur at ISR. The same applies for retransmissions. There are basically two transmit 
buffers: one for priority things (mostly management frames) and one for non priority 
things (including voice traffic). 

15 The RF reception process runs entirely at ISR. Packets are received in the 

two step process mentioned earlier. First the DMA transfers the PLCP header to memory 
and generates an interrupt. This allows the CPU to parse the header and setup for to 
transfer the rest of the packet using DMA. There will be 2 or 3 fixed sized packet buffers 
(different buffers may have different sizes and so we can place the packet in the optimally 

20 sized buffer). The receive process stays in the ISR long enough to receive enough of the 
packet to verify that it is for the client. This will be no more than 100/iS, worst case. 
Another interrupt will occur when the packet ends at which time the ISR will decide 
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whether to generate at ACK or not. The receive ISR will format and send the ACK, 
bypassing the RF Transmit ISR entirely. The receive packet buffer is passed to the main 
loop via an event flag and a buffer pointer. 

The main loop parses received packets and determines what to do with 

5 them. WEP decryption is done at the main loop level (again at the 400ns/byte rate). 
Packets may contain voice cells that go into the jitter buffer, data packets for the 
processor 20 or user application, or 802.1 1 management frames for the 
association/roaming process. 

As noted in earlier sections, most of the information presented so far is 

10 focused on the "Active Voice Transfer" mode of operation, which is essentially the 

mobile device 10 acting as a vehicle for full duplex voice conversations. There are other 
modes of operation and this section will describe those modes and how they work. 

A key concept here is that the other modes are generally different software 
modules that are loaded into the onchip memory of the DSP from external memory. This 

1 5 is because the amount of on chip memory for the 5409 is limited to 64KB. Execution out 
of off chip memory is both extremely slow as well as costly from a power consumption 
basis (executing in off chip memory doubles the power consumption of the DSP). For 
terminal devices, the external memory will be an external flash memory. For embedded 
mobile device 10, the source of will be the host process. This process will occur very 

20 quickly even for the flash memory case. The transfers will typically go at 20MB/sec (16 
bits every 100ns from the flash memory) and so 20KB of code would take only a 1-2 ms. 
All of the transitions between the various operating modes take place on much larger time 
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frames than a few milliseconds. It is likely that there will be a certain amount of shared 
data bases and data structures that most, if not all, the modes use. Items like the AP table, 
voice buffers, ESS and BSS information, etc. are likely to be shared across multiple 
modes. 

The following are some other operating modes for the mobile device 10. 
There may be other modes that develop over time. Furthermore, simply because a 
separate mode is identified does not mean that it may not be implemented as a submode 
within another mode. 

Initialization and Diagnostics - This mode is the initial startup 
state in which the mobile device 10 performs hardware and 
software initialization. It may also perform some basic poweron 
diagnostics 

Local Voice Recognition/RF PSP - This mode is the normal sleep 
state for the mobile device 10. It is waking up for beacons from 
the AP and it is looking for certain "magic words" as input using 
the embedded small vocabulary speech recognition program. A 
key press may also wake the device up as may the reception of a 
packet from the network. 

Non Voice RF CAM - In this mode, the unit may be actively 
transmitting data packets but is not processing a voice stream. 
This mode may be frequently used when the mobile device 10 is 
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inside a terminal product and the terminal is doing data only 
activities. 

• Voice/RF PSP - This mode supports an intercom/walkie talkie 

mode of operation in which an added latency is acceptable because 
5 the voice traffic is not full duplex as it would be in a telephone 

type call. 

Remote Voice Recognition/RF CAM - This mode is a variant of 
the Active Voice Transfer mode in which the goal is to offer the 
remote voice recognition server the best voice quality possible, 
10 even if it consumes additional network resources. The rationale is 

that the duration of such modes is short and good recognition 
performance is a key. 
The Initialization and Diagnostics is basically the startup mode of the 
mobile device 10. It runs some basic poweron diagnostics, initializes hardware and 
1 5 software, and then loads the real runtime code. 

The Local Voice Recognition mode of the mobile device 10 runs at a 
greatly very reduced instruction execution rate (1-2 MIPS rather than 100 MIPS) and so 
power consumption is greatly reduced. It is not likely to be halted entirely since it 
consumes much less power when simply slowed down. In this mode, it keeps a timer 
20 running for waking up the RF section for brief times to receive beacons. The rest of the 
time the RF section is entirely powered down and the serial ports/DMA channels 
connected to the BBP are also halted. The other activity in this mode is looking for local 
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wake up signals of which there may be at least two. The first is a key press. The second 
is a spoken "magic" word. To do this, the mobile device 10 keeps the audio input 
code/serial port/dma engine running, dumping samples into memory. Every 10 ms or so, 
a voice recognition routine; is invoked to process the received audio samples, looking for 
5 the magic word. This process has a very low demand on CPU resources and so the 

DSP 20 can be slowed down a lot. During this time the output audio system is also shut 
down since there is nothing to listen to. 

The NonVoice RF CAM mode is the active voice transfer mode without 
any voice processing going on. The entire audio section is shut down including the code, 

10 serial ports, and DMA channels. It is conceivable that the DSP may also be slowed down 
in this state since the only real activity is handling the RF interface. 

In the Voice/RF PSP mode of operation the radio is operating in PSP 
mode but there is an active voice channel. The DSP operates at full rate with the audio 
section active, but the RF section is generally powered down. It is woken up every 

15 100 ms or so to receive (or send) the next voice packet. The number of voice cells/packet 
is increased from 2 or 3 to perhaps as many as 10. 

The Remote Voice Recognition/RF CAM mode is a variant on the active 
voice transfer mode in which a higher quality voice sample is sent briefly to the server 
17. The improvement is only from the mobile device 10 to the server 17. The server 17 

20 to mobile device 10 voice channel may utilize a more compressed voice channel. It may 
also be disabled entirely. There are several options for the voice processing required in 
this mode. The simplest is to simply increase the sampling rate and send the data as 16 
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bit values. This involves virtually no processing. Another approach is run feature 
extraction algorithms on the data and sent the processed results to the voice server. In 
any case both the RF and voice sections are active just like they are in active voice 
transfer mode. 

In a further alternate embodiment illustrated in Fig. 5 a barcode scanner 90 
is connected to the 16 bit bus of the DSP 20. There is clearly sufficient CPU cycles 
available on the DSP 20 to handle the decode process of a one dimensional barcode (there 
might even be enough for a two dimensional code). 

The operation of a bar code decode engine is very similar to voice 
compression. One gets a sample of digitized analog data on a regular basis and then one 
processes it looking for interesting patterns. The process repeats until the decode has 
been completed or the user gives up. On a 100 scans/sec engine, there is a sample of data 
every 10 ms (just like voice samples). 

In such an integrated model, the user would need to make a choice 
between scanning and talking. When the scanner decode was in process, the speaker 
would go quiet and no voice data would be accepted by the code. Of course if no voice 
transfer was in process at Ihe time, then there would be nothing to shutdown. The RF 
would continue to operate much in the same way as voice processing and RF processing 
operate. 

The biggest issue could well be the code size of the decode software. 
Trying to run it in the onchip memory of the DSP 20 could well be impossible given the 
other activities going on. It is possible to add some external SRAM to the DSP into 
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which the decode software could be placed. Executing out of that SRAM would incur a 
performance penalty and power consumption would increase. On the other hand, even at 
a reduced performance level the DSP could still run the decode algorithm and the 
increased power consumption (and cost) would still be less than with a separate decode 
5 processor. 

While there have been described what are believed to be the preferred 
embodiments of the invent ion, those skilled in the art will recognize that other and further 
modifications may be made thereto without departing from the spirit of the invention, and 
it is intended to claim all such changes and modifications as fall within the true scope of 
10 the invention. 



NY02:283314.1 



-31- 



A33366-072797.0129 



I CLAIM : 



1 1 . A method for operating a system by voice command, comprising: 

2 providing a mobile unit having a microphone, a digital signal 

3 processor and a radio module for providing wireless data communications to a computer; 

4 receiving first voice commands having a limited vocabulary in said 

5 mobile unit, recognizing said first voice commands in said digital signal processor and 

6 controlling said mobile unit in response to said first voice commands; 

7 receiving second voice commands in said mobile unit, converting 

8 said second voice commands to digital data signals in said mobile unit and sending said 

9 digital data signals to said computer using said radio module; and 

1 0 operating said computer to recognize said second voice commands 

1 1 using a large vocabulary voice recognition program to derive computer control signals 

12 therefrom. 

1 2. A method as specified in claim 1 wherein said controlling said 

2 mobile unit in response to said first voice commands comprises controlling said mobile 

3 unit to communicate with said computer. 

1 3. A method as specified in claim 1 further including operating said 

2 computer in response to said computer control signals. 
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1 4. A method as specified in claim 3 wherein said mobile unit further 

2 includes a speaker, and wherein said operating said computer comprises retrieving data 

3 from a memory of said computer, converting said retrieved data into voice data, sending 

4 said voice data to said mobile unit, and converting said voice data to analog signals to be 

5 supplied to said speaker. 

1 5. A method as specified in claim 1 wherein said mobile unit is 

2 provided with a speaker, and wherein said computer control signals are arranged to 

3 establish a voice communications channel between said mobile unit and at least one other 

4 voice communicating device, and wherein said computer is operated to establish said 

5 voice communications channel to transfer voice communication data between said mobile 

6 unit and said other voice communications device. 

1 6. A method as specified in claim 5 wherein establishing said voice 

2 communications channel includes converting said voice communications data between 

3 digital and analog form. 

1 7. A mobile device, comprising: 

2 a microphone for receiving sound signals; 

3 an interface, connected to said microphone for converting received 

4 sound signals from said microphone to data signals; 
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5 a radio module for sending wireless data communication signals; 

6 and 

7 a digital signal processor, said processor including a program for 

8 (1) recognizing a limited number of digital data signals from said interface and operating 

9 in response thereto to control said radio, (2) operating said radio module to send digital 

10 data signals, and (3) providing digital data signals corresponding to sounds from said 

1 1 microphone as data packets to said radio module. 

1 8. A mobile device as specified in claim 7 wherein said digital 

2 processor is further programmed to compress said digital data signals corresponding to 

3 sounds. 

1 9. A mobile device as specified in claim 7 further comprising a 

2 speaker, wherein said interface is further arranged for converting digital data signals into 

3 sound signals and providing said sound signals to said speaker, and wherein said digital 

4 signal processor is further programmed to provide digital data signals received by said 

5 radio to said interface for conversion to sound signals. 

1 10. A mobile device as specified in claim 9 wherein said digital 

2 processor is further programmed to compress said digital data signals from said interface 

3 and to decompress digital data signals received by said radio. 
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1 1 1 . A mobile device as specified in claim 10 wherein said digital 

2 processor controls said interface to have a first sampling rate for voice signals data 

3 corresponding to commands and to have a second sampling rate for voice signals 

4 corresponding to voice communications. 

1 12. A mobile device as specified in claim 9 wherein said digital 

2 processor is interfaced to a host processor for transferring data signals to be sent or 

3 received using said radio module. 

1 13. A mobile device as specified in claim 9 wherein said digital 

2 processor is interfaced to a bar code scanner for receiving bar code signals from said 

3 scanner, and wherein said digital processor is further programmed to convert said bar 

4 code signals to digital data signals. 

1 14. A mobile device as specified in claim 9 wherein said digital 

2 processor is programmed to receive data signals from said interface and to alternately 

3 supply said data signals to first and second buffer memories during alternating first and 

4 second time intervals by direct memory access, and wherein said processor is 

5 programmed to process data in one of said data buffers while said data signals are 

6 supplied to the other of said data buffers. 
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1 15. A mobile device as specified in claim 14 wherein said processor is 

2 programmed to process sai d data using a compression algorithm. 
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ABSTRACT OF THE DISCLOSURE 

A mobile device is arranged to receive first voice commands to be 
interpreted by a digital signal processor in said device having a limited vocabulary voice 
recognition program and to receive second voice commands which are converted to voice 
representative data signals to be sent by a WLAN to a remote computer for interpretation 
using a large vocabulary voice recognition program. The mobile device provides voice 
control of the remote computer and can also provide voice activated voice 
communications. 
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